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Abstract. For a random graph on n vertices where the edges appear with individual rates, we 
give exact formulas for the expected time at which the number of components has gone down to 
k and the expected length of the corresponding minimal spanning forest. 

For a random bipartite graph we give a formula for the expected time at which a fc-assignmont 
appears. This result has bearing upon the random assignment problem. 



1. Introduction 

The study of random graphs, mitiated by Erdos forty years ago, has been preoccupied by asymptotic 
results. In this paper, we will derive exact enumerative formulas in some cases where previously 
only asymptotics were known. 

By a random graph G{t) on n vertices we mean a stochastic process starting with an edgeless 
graph &i t — and where edge after edge appears at random times. We take these times to be 
independent exponentially distributed variables, so the probability of edge not having appeared 
at time t is exp(— a^jt). The rates aij may be viewed as labelling the edges of if„ and any nonnegative 
edge labelling specifies such a random graph process. 

The number of graph components at time i is a thoroughly studied statistic and asymptotics for 
its expectation and distribution are known in many cases (see Janson Of particular importance 
is the time T„ at which the graph becomes one component, i.e. gets a spanning tree, and also the 
minimal length L„ of such a tree, i.e. the sum of its edge times. (By Kruskal's algorithm the first 
tree has indeed minimal length.) For the simplest case when all rates are 1, there is the following 
beautiful result by Frieze. 

£rnE(i„)=C(3) = l + ^ + ^ + ^ + --- = 1.202... 

The first published version of the above result seems to be by Fenner and Frieze where the 
distribution used was the uniform distribution on [0, 1]. Later Frieze |S] showed that the same result 
holds for any distribution function F such that F'{Q) exists and equals 1. The Exp(l) distribution 
is an obvious example. 

In the first theorem of this paper, we will give simple exact formulas for the following more general 
statistics. 

E(T„(fc)) = E(i I G{t) becomes a fc-componcnt graph) 
E(L„(fc)) = E(length of the minimal spanning fc-forest) 

Note that our formulas work for arbitrary rates aij . The special case where all = 1 turns out to 
be a sum over all integer partitions of n fCorollarv ll.2|l . In the general case we will use the following 
notation. Let the rates label the edges f„ of K^- For an edge G £^„, let [cij] = a^j and for 
a subset A C £„ of edges, let [A\ denote the rate sum X^e sa'^'j- ^ subset A C £■„ is a clique if 
it spans a complete subgraph. Any partition of the vertices of Kn into j parts defines a subgraph 
consisting of j cliques. If B is the edge set of such a subgraph, we write B G Clique^. 
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Theorem 1.1. For a random graph G{t) on n vertices with rates a^j 

r{j, k) 



E(T„(fc)) 
E(L„(fc)) 



E E 

j^fc+i _BeCliquc^ 

71 

E E 



[£n - B] 
[£n - B] 



where T{j, k) ~ X]i=fc+i ^(i' *) '^"■'^ '^(Jj ^) — Ei=fc+i (* ~ k)s{j, i) are sums of signed Stirling numbers 
of the first kind. 
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Table 1. The recursion s{j + l,i) = s{j,i — l) — j s{j,i) is shared by t and A 



For example, A'3 has three subgraphs in Chque2 and one in Chque3, so 

E(r3(i)) = — ^ — + — ^ — + — ^- ^ ■ 

ai2 + ai3 012 + 023 ai3 + 023 ^12 + ai3 + ^23 
If we put all Oij = 1 and also fc = 1, we should get an expression that tends to Frieze's result when 
n goes to infinity. We use the notation (j^ei ) ^^r the multinomial number (j\Yi(2\Y2 — • 

Corollary 1.2. For a random graph G{t) on n vertices with rates 1 

= (l'=l,2=2,...) (ei,e2,...) f (n^ - (l^ei +2262-^... )) 

E(i„(l)) = J2 (l = l,2"2,...)(ei,r2,...) ( = )(„2-(l2ei+22e2+...)) 

where the sums are over integer partitions of n into two or more parts, i.e. nonnegative integer 
sequences (e^) such that e = ei -|- 62 -|- • ■ ■ > 2 and lei + 2e2 + • • • = n. 

For example, E(L3(1)) = (1^2) ( A)/© (9 " 1 "4) - d j.J (g)/© (9 -3) = |. Clearly, the terms of 
the formulas grow exponentially with n. It is an intriguing consequence of Frieze's theorem that, 
nevertheless, the last sum must tend to (^(3) as n ^ 00. 

1.1. Random assignment. Our reason for studying this problem was to get a grip on the random 
assignment problem which is the study of minimal assignments in a random bipartite graph G{t) C 
Km,n- As before, the time of appearance of edge e.y is assumed to be exponentially distributed with 
rate and we consider these rates as labelling the edges of the complete bipartite graph Km,n- 
A ^-assignment is a disjoint set of k edges. The following two statistics are of special interest. 

E(Tm„(fc)) = E(t I G{t) obtains a fc-assignment) 

E(Lmn(fc)) = E (length of the minimal ^-assignment) 

The analogy with the previous definitions is deceptive, for here the fc-assignment of Tmnik) is not the 
^-assignment of Lmn{k) in general! Since the length of an assignment is the sum of the appearance 
times of the edges involved, the first /c-assignment to be completed need not have the minimal 
length. Because of this, the computation of E(L„i„(/c)) seems to be quite difficult and we have 
obtained formulas only for the simplest cases. 
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Figure 1. The four tableau shapes in Tabloid2 



For k = m = n, we have the original random assignment problem. The following conjecture, 
proposed by Parisi [Sj in 1998, has been verified up to n = 7. 

Conjecture 1.3. E(L„„(n)) = 1 + 1/4 + 1/9 H h l/ji^ 

Note that this expression tends to C(2) as n ^ oo. This asymptotic conjecture, stated by Mczard 
and Parisi in 1985 [7], was proved by Aldous p. 

Generalizations of the conjecture to general k, m, n have been proposed by Linusson and Wastlund 
in] and by Buck, Chan and Robbins [2], who also consider more general rates Uij. As far as we know, 
however, formulas for arbitrary rates have not been obtained before. 

For the expected time until the first fc-assignment appears, our method gives the following formula. 



Theorem 1.4. 



E(T™„(fc)) 



E 



Sn 



B] 



SeTabloidfc 

The numerators S„i^n,k{B) are certain products of binomial coefficients to be defined in Section 3. 
A proper edge subset B is called tabloidal if the matrix entries G _B fill a tableau shape (Young 
diagram), possibly after permutation of rows and columns. It is in Tabloid^ if the inner corners of 
this tableau satisfy i + j < k and the outer corners i + j > k. 

With k = 2, for example, four tableau shapes arc involved and the expression becomes 

(--1) (-1) _ ^ 1 ^ ^ 1_ 



E(T,„„(2)) 



[£n 



E 



1 



- coL 



If we try to compute L„in{k) as Tmn(l) + T'mn(2) + • • • + T„in(k) we get a value that is too small, 
since the subassignments of the assignment of minimal length need not be minimal for T„j„. For 
small k we have been able to calculate E(L„„(A:)), for example 

ay[rowi - eij][colj - e^] 



E(L™„(2)) = E(r„„(l)) + E(T,„„(2)) + J2 



{£m,n\ {£ii 



j] [£n 



rOW^] [£jn,n - Colj 



The derivation of this expression and similar ones will appear in our forthcoming paper 

2. Minimal spanning fc-FORESTS 

This section is devoted to the proof of Theorem 11.11 The formula to be proved is of the form 
E(T„(fc)) = ^cb/[£'„ — B] summed over all edge sets B (the constant cb being zero for most B). 
Now, the terms on the right-hand side can be interpreted according to an elementary property of 
exponentially distributed variables: the expected minimum of a set of such variables is the reciprocal 
of the total rate. In our case, this means E(minimal appearance time of an edge in A) ~ ^/[A]. 

In view of this, we are going to prove a derandomizcd version of the formula, obtained by removing 
the expectation operator. 



(1) 



Tn{k) = ^ CB niin(£„ - B) 



In this version, Tn{k) is no longer a stochastic variable. The graph G{t) may have been constructed 
by any process, random or nonrandom, with edge e^- appearing at time tij and by min{£„ — B) we 
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mean min(tij | e^j ^ B). Thus wc have the problem of proving a purely combinatorial formula valid 
for any set of numbers tij . 

In order to compute the time Tn{k), we will study the Boolean lattice of all subsets of edges. 
Each subset A C 5„ of edges determines a subgraph (V„, A) of Kn, where all n original vertices are 
still present. Let k{A) denote the number of connected components of this subgraph. 




Figure 2. The Boolean subgraph lattice of partitioned according to the number 
of components. 

When edges appear, one at a time, we follow a path upwards in the Boolean lattice from the 
empty set to the complete edge set. At some time T'„(fc), the path reaches a set A where k{A) = k. 
The first part of the following lemma simply states that this time is the sum of the time intervals 
spent in each node on the upward path. 

Lemma 2.1. 

(2) T„(fc)= ^(^•')' 

K{Ai)>k 

where Ai is the set consisting of the first i edges (so Aq = and T(Ai) is the time spent in 
configuration Ai until another edge appears. For any edge set A, the expression 

(3) = ^ (-1)1^-^1 min(£:„ -i?) 

BCA 

evaluates to T{Ai) if A is an Ai and to zero otherwise. 

Proof. Let i = \A\, let Ci be the last edge in A to appear and let ti be its time of appearance. For 
every B (- A, let B be the result of toggling e,;, that is either including or deleting it. In most cases, 
the contributions of B and _B in (O will cancel, for ti will seldom be a minimum. The only case for 
which min(£'„ — B) ~ ti is when all earlier edges are in B. But then B ~ A consists of the first i 
edges, that is A = A.;, and we have min(f„ — B) — min(£'„ — B) = t^+i — ti = T{Ai). Otherwise, all 
terms arc cancelled by the toggling. □ 

Combining the two formulas in Lemma l2.1l we obtain the following expression for the time at which 
the number of components of the graph G{t) has gone down to k: 

(4) r„(fc)= E(-i)'^"'" ™^(^"-^)= E E (-i)'^""" 

K{A)>kB<ZA BC£„ BCA, K{A)>k 

We next observe that the last sum is zero for many sets B. For suppose that B has some component 
that is not a clique (a complete subgraph). In other words, there exists some edge e that we can 
add to B without changing the number of components of B. This property is obviously shared also 
by all supersets A oi B where e is not already present. Pair off every such A with A + e. The pair's 
contribution to the sum is (— l)'"^^-^' + (— = 0. The entire sum is over a disjoint collection 
of such pairs, so it is zero too. 
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Wc have now reduced the sum to sets B where all components are chques. If there are j com- 
ponents, we write B G Chque^. It turns out that each chque can be contracted into a node of 

Lemma 2.2. If B ^ Chque, , the following equality of generating functions holds: 



y: i~ir~'^x^^^^ = E (-1) 

BCAC£„ A'C£. 



Proof. Fix j and argue by induction over n. The equahty is triviaUy true for n = j (in which case 
B must be the empty set). Suppose it holds for n ~ i > j. Now study n = i + 1. Some clique has 
at least two vertices, say x and y. We shall see that omitting vertex y does not affect the sum. 

If A is an extension of B including an edge from y to a vertex z outside the clique, then toggle edge 
xz to obtain A (that is, the only difference between A and A is whether the edge xz is included or 
not). Obviously k(A) = ni-A) so they both contribute to the same term in the generating function. 
Their joint contribution is zero. Hence, the existence of vertex y can be disregarded in the sum over 
edge sets A that are extensions of _B, so we are down to i vertices again. □ 

The right-hand generating function of (|2.2|l can be computed explicitly. It can be derived as a 
special case of the prime example of the exponential method in Stanley's book |10| . but we prefer 
to give a direct proof. 

Lemma 2.3. 

^ (_l)l^'l^«(^') ^ a;(x - l){x -2)---{x-j + 1). 

(This is the generating function X]i=i *)^* /''^ signed Stirling numbers of the first kind.) 

Proof. We argue by induction on j. The formula is true for j = 1. For j > 1 vertices, all A' 
such that at least two edges involve the jth vertex can be combined into zero-contributing pairs by 
toggling the edge forming a triangle with the lexically first pair of such edges. Hence, we need only 
consider A' where the jth vertex is involved in zero edges (one possibility, decreasing the number 
of components by one if removed), or one edge {j — 1 possibilities, changing sign if removed), thus 
resulting in the factor {x ~ j + 1) times the generating function for j — I vertices. □ 

By the previous two lemmas we can compute the sum for B G Clique^ as a sum of Stirling 
numbers: 

E (-1)'-^-^'= E E ^a^)- 

BCAC£„, K(A)>k A'C£j. K{A')>k i=k+l 

We can now plug this into 10} and obtain 

n j 

Tn{k)^ ^ ^ s{i,i) ^ min(f„-S), 

j=k+li=k+l BGClique^- 

thereby proving the first formula of Theorem ll.il 

2.1. Special cases. The spanning tree special case (i.e. fc = 1) can be simplified further. Since the 
sum of all Stirling numbers s{j,i) for a fixed j > 1 is zero, we have 

E^O'*) = -^OM) = (-i)'0'-i)'- 

i=2 

(Recall that the signed Stirling numbers of the first kind have the combinatorial interpretation that 
s{j,i) is (—I)-'"'"* times the number of permutations in Sj with i cycles.) 
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Specializing to all rates equaling 1, every partition B of /v„ into cliques of given sizes con- 
tributes equally. With ei cliques of size 1, 62 cliques of size 2, etc., the number of possibilities for B 
is 



(l!)'^iei!(2!)'--2e2!...' 

and the rate reciprocal l/[£n — B] is 2/{n^ — (l^ei + 2^62 + ...)). Let e = ei + e2 + ■ ■ ■ denote the 
number of cliques. We must multiply everything by (— l)'^(e — 1)! = {—lyel/e. Thus we obtain 

n \fe\ {-ly 



^ (p^2«^...) (ei,e2,...) 



(7l2-(l2ei+ 2262 + ...))' 



where the sum is taken over all 61,62, . . . such that lei + 262 + • ■ • = n. This proves the first 
statement of Corollarv ll.2l 

2.2. Expected length of the minimal spanning /c-forest. By Kruskal's algorithm for spanning 
trees, the minimal spanning fc-forest in Kn consists of the longest edges of the first spanning r-forests 
for r = n, n — 1, n — 2, . . . , fc. 

Thus, the expected length of the minimal spanning fc-forest in Kn is 

r=k r=k j=r+l i=r+l SsClique^ " 

n j i-1 ^ 

= E E E*(^'*) E [g 

j=k+l i=k+lr=k SGCliquCj ^ " 

proving the second half of Theorem ll.il The second half of Corollarv ll.2l is proved in the same way 
as the first half. 

3. Expected time to a fc-ASSiGNMENT in K^.n 

In this section we study a random bipartite graph G{t) C Km,n- Instead of trees, we are waiting for 
a fc-assignment (a collection of k disjoint edges) to appear. Let ^,{A) denote the size of the maximal 
subset of disjoint edges in an edge set A C £rn,n- 

As edges appear in Git) we follow a path from the empty set and upwards in the Boolean lattice, 
until we reach a set A where /x(j4) = fc. Let Tmn{k) be the waiting time. Precisely as in the previous 
section, we have 

(5) E(T„„(fc))= P(^)E(rA). 

Ai(A)</c 

As before, we have E(Ta) = l/[£m.n — A\ and obtain 

(6) P(^)E(r^) = I _Rr 

BCA ^ 

and finally 

(7) E(r„„,(fc))= Y re E (-l)'^"""- 

SC£,„_„ ' BCA, fj,{A)<k 

Hence, we have reduced the problem of computing E(T„j„(fc)) to a sum over edge sets B where 
each term is a fraction. The denominators are simple, but the numerators are sums over supersets 
A D B that do not contain a fc-assignment: 

(8) ^™,„,fe(B):= Y (-l)'^"""' 

BCA, ^i{A)<k 
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which turns out to be much more comphcated than in the previous problem. Note that S'„i_„ jt(0) = 
if A; = (no terms in the sum) and also if fc > ?ti or fc > n (toggle any edge to see that the terms 
cancel). 

An edge set B can be identified with a subset of the entries of an m x n matrix. Note that 
the existence of a /c-assignment is independent of the order of rows and columns. We say that a 
proper subset B is tabloidal if the rows and columns can be reordered to make B tableau shaped, 
i.e. consisting of all entries above a lattice path. It turns out that only tabloidal B give nonzero 

Lemma 3.1. Sm.n.k{B) ^ only for tabloidal edge sets B . 

Proof. It is easy to see that B is nontabloidal if and only if its set of matrix entries contains the 
pattern ^ . For some of the supersets Am JHJ, the upper right square is filled, and these cancel the 
terms where this square is removed (the toggling trick). We are left with A such that the inclusion 
of the upper right square would complete a fc-assignment (that clearly doesn't involve the lower left 
square). Since //(A) = fc — 1, Konig's theorem tells us that it can be covered by fc — 1 rows and 
columns. The upper right square isn't covered so the lower left square must be covered. Therefore 
it can be toggled and so everything cancels out. □ 

Not even all tabloidal edge sets B contribute to the sum, only those in Tabloid^ , that is such that 
the inner corners of the tableau lie above the /c-diagonal and the outer corners lie below or on it (see 
Fig. El- For such a tableau the numerator Sm,n.k{B) is determined by the empty mid rectangles 
at the inner corners, defined by extending the vertical and horizontal lines. Let the size of the ith 
rectangle be rui x rii and note that B e Tabloidfc means that the fc-diagonal intersects the rectangle, 
defining a local fci-diagonal. 




ki 



1^2 
I 1 



mi 



m2 



n ni 712 

Figure 3. The decomposition into mid rectangles. 

Lemma 3.2. With the notation of the figure, the following formula is true for any tabloidal B ^ %. 

I 



Sm,n,k{B) = (-l)'+'n^" 



Note that the product will be nonzero only if all ki > (inner corners above the k-diagonal) and all 
ki < TUijUi (outer corners below or on the k-diagonal). 

Proof. We must count extensions of B not containing a fc-assignment. Sm.n.k{B) is the number of 
even legal extensions minus the number of odd legal extensions. 

First we show that it is enough to consider extensions obtained by filling in entries in the mid 
rectangles only. Let U be the set of entries not in these rectangles nor in B. To see that the 
contribution of all extensions with some fixed entry in U is zero we invoke Lemma l3.1l This argument 
can be repeated for all entries in U and, by the inclusion-exclusion principle, the total contribution 
of extensions with at least one entry in U is 



E 



Sm,n.kiB + V)=0. 
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Then we show that an extension contains a fc-assignment if and only if the tth mid rectangle 
contains a fci-assignment for all i. Suppose that the latter statement is true. Then rows and columns 
can be permuted such that the dashed fc-diagonal is covered, thus producing the fc-assignment. 
Conversely, if the filled entries of the ith mid rectangle can be covered by less than ki rows and 
columns, then together with the rows above and the columns to the left we have a covering of all 
filled entries by less than k rows and columns. 

Finally we note that the signed sum of extensions not containing a fc-assignment is equal to minus 
the corresponding signed sum of extensions containing a fc-assignment, since = where 

the sum is taken over all subsets A of the union of the mid rectangles. Thus we get that 

/ 

□ 

Now, we are finally able to complete Theorem II. 41 



Proposition 3.3. In the formula 



SeTabloidfc ^ ™'" ' 



the numerators are given by 

h - 1 



i=l 



where rui, Ui, ki refer to the I mid rectangles at the inner corners of the tableau B (see Fig.\^. 
Proof. The only thing left to show is that 



5, 



m — 1\ /n — 1 
fc-1 jU-1 



and we will prove this by induction. The statement is trivial for m = n = \ and for fc = 1. Our 
induction assumption will be that the statement is true for all values of m, n, fc smaller than the 
current ones (with at least one strict inequality). 

We regroup the terms in Sm^n.ki^) according to the number of filled entries in the first row. Let 
Sq denote the contribution from the legal extensions where the first row is completely empty. By 
inclusion-exclusion we get 

ycrowl 

and by Lemma |3. 21 all terms in the sum vanish except for V = $ (the term we want to compute), 
\V\ = n (the whole row filled) and \V\ = fc — 1 (and there are {f,^^^ such terms). It is obvious how 
to reduce the cases of an empty row and of a filled row, so we get 

5'm,n,A:(0) = 'S'm_i,„.fc(0) — (—l)<S'„i_i^fc-l, fc-1 (0)5*1, „_A;+14 (0) — S'm_i,„,fc_l (0) 

m - 2\ fn -l\ / n \ /m-2\/k- 2\ /0\ /n - k\ /?n - 2\ /n - 1 
fc - ij U - ij ^ Vfc - 1/ U'- 2 j U - 2/ VO/ V )^\k-2)\k-2 

fc - 1 j U - 1 

This completes the proof. □ 
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4. Conclusion 



In this paper we succeed in finding formulas for E(T„(fc)), the expected time at which a spanning 
fc-forest has appeared, for E(L„(fc)), the expected length of this minimal fc-forest, and for E(r,„„(fc)), 
the expected time at which a fc-ixiatching appears. However, we fail to find a formula for E(i,„„(fc)), 
the expected length of a minimal fc-matching. Why is this? 

The answer emerged in Sec|21 in which we found that the first three statistics shared an important 
property . Given the actual times Uj at which the edges appeared, it is possible to express T„(k), 
Ln{k) and Tmn{k) as sums of terms of the form min(A), by which is meant the minimal time in the 
edge set A. 

It is clear that many other statistics can be expressed analogously, thereby giving rise to similar 
expectation formulas. However, the minimal length assignment Lm„(fc) cannot be expressed by 
subset minima in this way. We have been able to derive formulas for L„in{k) for small k (see [21) 
and already fc = 2 gives a counterexample. 



It is easy to verify that this rational function cannot be rewritten with linear denominators. 
Therefore, Lmn{k) is not expressible as a linear combination of minima. 
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